Using Google to Create a More Accurate and Easily-Extensible Spell Corrector
نویسندگان
چکیده
Spell checkers are now a common, integrated part of many commercial and freely available word processing programs. Agglutinative languages (such as Hungarian and Finnish) pose a separate problem, as there are many different " correct " forms for any given word. Due to the seemingly infinite number of possible words, the limited scope of a dictionary (provided with most spell-checking software) poses an obvious problem – if not only in terms of computability – for completeness. In this paper, we explore means of isolating and recommending corrections for misspelled words in English using the Internet as a corpus, and then discuss methods of extending these processes to another language, specifically the agglutinative Hungarian.
منابع مشابه
Context-sensitive Spelling Correction Using Google Web 1T 5-Gram Information
In computing, spell checking is the process of detecting and sometimes providing spelling suggestions for incorrectly spelled words in a text. Basically, a spell checker is a computer program that uses a dictionary of words to perform spell checking. The bigger the dictionary is, the higher is the error detection rate. The fact that spell checkers are based on regular dictionaries, they suffer ...
متن کاملAnalysis of the Spell of Rainy Days in Lake Urmia Basin using Markov Chain Model
In this study, the Frequency and the spell of rainy days was analyzed in Lake Uremia Basin using Markov chain model. For this purpose, the daily precipitation data of 7 synoptic stations in Lake Uremia basin were used for the period 1995- 2014. The daily precipitation data at each station were classified into the wet and dry state and the fitness of first order Markov chain on data series was e...
متن کاملDesign and Implementation of a Spell Checker for Hausa Language (Étude et conception d'un correcteur orthographique pour la langue haoussa) [in French]
In this paper, we have designed, implemented and tested a spell corrector for the Hausa language which is the second most spoken language in Africa and do not yet have processing tools. This study is a contribution to the automatic processing of the Hausa language. We used existing techniques for other languages and adapted them to the special case of the Hausa language. The corrector designed ...
متن کاملReview of Oral Saliva Measurement Standard Methods
Background and Aim: At present, there are various methods for performing diagnostic tests. Due to the fact that saliva is easily available compared to other body fluids, it can be collected from patients in a non-invasive way and there is no risk of cross-infection because of it, so its use has become more common. Also, the analysis of substances in saliva can be a reflection of a person's oral...
متن کاملارائه یک رتبهبند برای خطایاب معنایی با استفاده از ویژگیهای حساس به متن
Nowadays, a large volume of documents is generated daily. These documents generated by different persons, thus, the documents contain spelling errors. These spelling errors cause quality of the documents are decrease. Therefore, existence of automatic writing assistance tools such as spell checker/corrector can help to improve their quality. Context-sensitive are misspelled words that have been...
متن کامل